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■ We consider the 'one-shot frame synchronization problem' where a decoder wants to locate a sync pattern 
at the output of a channel on the basis of sequential observations. We assume that the sync pattern of length N 
starts being emitted at a random time within some interval of size A, that characterizes the asynchronism level 
between the transmitter and the receiver. We show that a sequential decoder can optimally locate the sync pattern, 
i.e., exactly, without delay, and with probability approaching one as N — » oo, if and only if the asynchronism 
level grows as 0(e ), with a below the synchronization threshold, a constant that admits a simple expression 
depending on the channel. This constant is the same as the one that characterizes the limit for reliable asynchronous 
communication, as was recently reported by the authors. If a exceeds the synchronization threshold, any decoder, 
sequential or non-sequential, locates the sync pattern with an error that tends to one as N — > oo. Hence, a sequential 
decoder can locate a sync pattern as well as the (non-sequential) maximum likelihood decoder that operates on the 
basis of output sequences of maximum length A + N — 1, but with much fewer observations. 

Index Terms 

Quickest detection, frame synchronization, sequential analysis 

^ ' I. Introduction 

Frame synchronization refers to the problem of locating a sync pattern periodically embedded into data 
O \ and received over a channel (see, e.g., flU, J3]|, J6]|, jH). In flU Massey considered the situation of binary 
data transmitted across a white Gaussian noise channel. He showed that, given received data of fixed 
size which the sync pattern is known to belong to, the maximum likelihood rule consists of selecting the 
. £^ \ location that maximizes the sum of the correlation and a correction term. 

We are interested in the situation where the receiver wants to locate the sync pattern on the basis of 

■ sequential observations, which Massey refers to as the 'one-shot' frame synchronization problem in [Q. 
Surprisingly, this setting has received much less attention than the fixed length frame setting. In particular it 
seems that this problem hasn't been given a precise formulation yet. In this note we propose a formulation 
where the decoder has to locate the sync pattern exactly and without delay, with the foreknowledge that 
the pattern is sent within a certain time interval that characterizes the level of asynchronism between the 
transmitter and the receiver. Our main result is the asymptotic characterization of the largest asynchronism 
level with respect to the size of the sync pattern for which a decoder can correctly perform with arbitrarily 
high probability. 

II. Problem formulation and result 

We consider discrete-time communication over a discrete memoryless channel characterized by its finite 
input and output alphabets X and y, respectively, transition probability matrix Q(y\x), for all y £ y and 
x £ X, and 'noise' symbol * £ X\\ 
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Fig. 1. Time representation of what is sent (upper arrow) and what is received (lower arrow). The V represents the 'noise' symbol. At 
time v the sync pattern starts being sent and is detected at time r. 



The sync pattern s consists of iV > 1 symbols from X — possibly also the * symbol. The transmission 
of the sync pattern starts at a random time v, uniformly distributed in [1, 2, . . . , A], where the integer A > 1 
characterizes the asynchronism level between the transmitter and the receiver. 

We assume that the receiver knows A but not v. Before and after the transmission of the sync pattern, 
i.e., before time v and after time v + iV — 1, the receiver observes noise. Specifically, conditioned on 
the value of v, the receiver observes independent symbols Y\, Y 2 , . . . distributed as follows. If i < v — 1 
or i > v + N, the distribution is Q(-\*). At any time % E [v, v + 1, . . . , u + N — 1] the distribution is 
Q{-\si-v+i), where s n denotes the nth symbol of s N . 

To identify the instant when the sync pattern starts being emitted, the receiver uses a sequential decoder 
in the form of a stopping time r with respect to the output sequence Fi, Y 2 , . . u If T = n the receiver 
declares that the sync pattern started being sent at time n — N + 1 (see Fig. [T]). The associated error 
probability is defined as 

P(r ^ u + N- 1) . 
We now define the synchronization threshold. 

Definition. An asynchronism exponent a is achievable if there exists a sequence of pairs sync pat- 
tern/decoder {(s N , tn)}n>i such that s N and tn operate under asynchronism level A = e aN , and so 
that 

F{r N ^v-N+1) . 

The synchronization threshold, denoted a(Q), is the supremum of the set of achievable asynchronism 
exponents. 

Our main result lies in the following theorem. 

Theorem. The synchronization threshold as defined above is given by 

a{Q) = max D{Q{-\x)\\Q{-\*)) , 

X 

where D(Q(-\x)\\Q(-\k)) is the divergence (Kullback-Leibler distance) between Q(-\x) and Q(-\*). Fur- 
thermore, if the asynchronism exponent is above the synchronization threshold, a maximum likelihood 
decoder that is revealed the maximum length sequence of size A + N — l makes an error with a probability 
that tends to one as N — > oo. 

A direct consequence of the theorem is that a sequential decoder can (asymptotically) locate the sync 
pattern as well as the optimal maximum likelihood decoder that operates on a non-sequential basis, having 
access to sequences of maximum size A + N — lj| 

Note that the synchronization threshold is the same as the one in [7), which is defined as the largest 
asynchronism level for which reliable communication can be achieved over point-to-point asynchronous 

2 Recall that a stopping time r is an integer-valued random variable with respect to a sequence of random variables {Yi}"^ so that the 
event {r = n}, conditioned on {Y}™ =1 , is independent of {Yi}°Z n+1 for all n > 1. 

3 Note that if the receiver has no foreknowledge on A, i.e., if A can a priori be arbitrarily large, the problem is ill-posed: for any decoder, 
the probability of miss-location of the sync pattern can be made arbitrarily large for A large enough. 
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channels. This should not come as a surprise since the limit of asynchronous communication is obtained in 
the zero rate regime where decoding errors are mainly due to a miss location of the transmitted message. 

We now prove the theorem by first presenting the direct part and then its converse. Recall that a type, 
or empirical distribution, induced by a sequence z N e Z N is the probability P on Z where P(a), a E Z, 
is equal to the number of occurrences of a in z N divided by N. 

Proof of achiev ability: We show that a suitable sync pattern together with the sequential typical- 
ity decode!] achieves an asynchronism exponent arbitrarily close to the synchronization threshold. The 
intuition is as follows. Let x be a 'maximally divergent symbol,' i.e., so that 

D{Q{.\x)\\Q{-\*)) = <*{Q) ■ 

Suppose the sync pattern consists of iV repetitions of x. If we use the sequential typicality decoding we 
already have almost all the properties we need. Indeed, if a < a(Q), with negligible probability the noise 
generates a block of N output symbols that is jointly typical with the sync pattern. Similarly, the block of 
output symbols generated by the sync pattern is jointly typical with the sync pattern with high probability. 
The only problem occurs when a block of N output symbols is generated partly by noise and partly by 
the sync pattern. Indeed, consider for instance the block of iV output symbols from time v — 1 up to 
v + iV — 2. These symbols are all generated according to the sync pattern, except for the first. Hence, 
whenever the decoder observes this portion of symbols, it makes an error with constant probability. The 
argument extends to any fixed length shift. 

The reason that the decoder is unable to locate the sync pattern exactly is that a constant sync pattern 
has the undesirable property that when it is shifted to the right, it still looks almost the same. Therefore, 
to prove the direct part of the theorem, we consider a sync pattern mainly composed of x's, but with a few 
*'s mixed irjf] to make sure that shifts of the sync pattern look sufficiently different from the original sync 
pattern. This allows the decoder to identify the sync pattern exactly, with no delay, and with probability 
tending to one as iV goes to infinity, for any asynchronism exponent less than a(Q). We formalize this 
below. 

Suppose that, for any arbitrarily large K, we can construct a sequence of patterns {s N } of increasing 
lengths such that each s N — si, s 2 , ■ ■ ■ , Sjv satisfies the following two properties: 
I. all s^s are equal to x, except for a fraction smaller than l/K that are equal to *; 
H. the Hamming distance between the pattern and any of its shifts of the form 

*,*,...,*, s u s 2 ,..., sjv-i i e [1,2, . . .,N) 

i times 

is linear in N. 

Now let A = e N ( a (®' € \ for some e > 0, and consider using patterns with the properties I and II in 
conjunction with the sequential typicality decoder r = rjy. 

By HI Lemma 2.6, p. 32] and property I, the probability that iV output symbols entirely generated by 
noise are typical with the sync pattern is upper bounded by exp (— N(l — l/K)(a(Q) — 5)), where 5 > 
goes to zero as the typicality constant // goes to zeroJl Hence, by the union bound 

P ({r < u} U {T > V + 2N - 1}) < e -N(e-5-(a-S)/K) 

which tends to zero for fi small enough and K sufficiently large0 

4 The sequential typicality decoder operates as follows. At time n, it computes the empirical distribution P induced by the sync pattern 
and the previous iV output symbols y n ~N+i, Un-N+2, ■ • ■ , y n - If this distribution is close enough to P, i.e., if \P(x, y) — P(x, y)\ < fj, for 
all x, y, the decoder stops, and declares n — N + 1 as the time the sync pattern started being emitted. Otherwise it moves one step ahead 
and repeats the procedure. Throughout the argument we assume that fi is a negligible strictly positive quantity. 

5 Indeed, any symbol different than x can be used. 

6 See footnote El 

7 If a(Q) — oo the upper bound is zero if /x is small enough. 



Fig. 2. Parsing of the entire received sequence of size A + N — 1 into r blocks y' tl \y ( - t2 \ ■ ■ ■ ,j/ < '"' 1 of length N, where the ith block 
starts at time U. 



If the N observed symbols are partly generated by noise and partly by the sync pattern, by property 
II, the Chernoff bound, and the union bound we obtain 

P (r G [v, v + 1, . . . , v + N - 2]) < (N - l)e~ niN) 

which vanishes as N tends to infinity. 
We then deduce that 

P(r = v + N - 1) -> 1 

as iV — > oo. 

To conclude we give an explicit construction of a sequence of sync pattern satisfying the properties I 
and II above. To that aim we use maximal length shift register sequences (see, e.g.,j2]|). Actually, for our 
purpose, the only property we use from such binary sequences of length I = 2 m — 1, m G [1,2,.. .), is 
that they are of Hamming distance (I + l)/2 from any of their circular shifts. 

To construct the sync pattern we start by setting s« = x for all 2 ^ mod K where, without loss of 
generality, K is chosen to satisfy [j^J = 2 m — 1 for some m G [1, 2, . . .)J| With this choice, property I is 
already satisfied. To specify the [^J positions i with i = mod K, pick a maximal length shift register 
sequence m 1 ,m 2 , . . . , m^N^, and set sj K — x if rrij = and Sj K = * if rrij = 1, for any integer j < |_-^J. 
It can be readily verified, using the circular shift property of maximal length shift register sequences, that 
this construction yields patterns that satisfy property II. 

■ 

Proof of the converse: We assume that A = e Na with 

a > max D(Q(-\x)\\Q (■]*)) 

X 

and show that the (optimal) maximum likelihood decoder that operates on the basis of sequences of 
maximum length A + N — 1 yields a probability of error going to one as N tends to infinity. 

We assume that the sync pattern s N is composed of N identical symbols s G X. The case with multiple 
symbols is obtained by a straightforward extension. Suppose the maximum likelihood decoder not only 
is revealed the complete sequence 

2/1,2/2, • • .,Va+n-i , 

but also knows that the sync pattern was sent in one of the r distinct block periods of duration N, where 
r denotes the integer part of (A + N — 1)/N, as shown in Fig. [2l 

Assuming Q(y\*) > for all y G 3^0 straightforward algebra shows that the decoder outputs the time 
£j, i G [1, 2, . . . , r], that maximizes 



f(y 



Q(y {t 



i)\ s N 



Q(yM\*) 



Note that f(y^) depends only on the type of the sequence y^ since s N is the repetition of a single 
symbol. For conciseness, from now on we adopt the notation Q s (y^') instead of Q(y^\s N ) and Q^y^ 1 ) 
instead of Q(y^\-k). 



8 We use [zj to denote the largest integer smaller than x. 

9 If Q(y\-k) = for some y G y we have a(Q) — oo, and there is nothing to prove. 
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Let Q s ± e denote the set of types (induced by sequences y N ) that are e > close to Q s with respect 
to the L 1 norm, and let E\ denote the event that the type of the z/th block (corresponding to the sync 
transmission period) is not in Q s ± e . It follows that 

P(£i) < e~ Ne (1) 

for some e = e(e ) > 00 Let Q s = argmaxp g Q s±eo /(P)o where with a slight abuse of notation f(P) is 
used to denote f(y N ) for any sequence y N having type P. Now consider the event E 2 where the number 
of blocks generated by Q+ that have type Q s is smaller than 

1 -jV(-D(Q s ||Q*)-oO 
2(N+1)\ X \ 

Using [1, Lemma 2.6, p. 32], the expected number of blocks generated by Q+ that have type Q s is lower 
bounded as 



E (number of type Q s blocks generated from Q*) > ^ — ^\ x f ND{ - Q ^ Q *\r - 1) 

> poly(iV)e- Ar ^^l' Q *)- a ) , 

and using Chebyshev's inequality we get 

¥(E 2 ) < poly(A^)e- Ar(a - D(Q " s|IQ * )) (2) 

where poly(iV) denotes a term that increases or decreases no faster than polynomially in N. 

Finally consider the event ^3 defined as the complement of EiL)E 2 . Given that ^3 happens, the decoder 
sees at least 

1 r -N(D«5.\\Q*)-a) 



2(N+ 1)1*1 

time slots that are at least as probable as the correct u\h. Hence, the probability of correct detection given 
that the event E 3 happens is upper bounded as 

P(corr.dec|£ 3 ) < poly(iV)e- 7V(a - D{Q " s|IQ * )) . (3) 

We deduce from ©, ©, and © that the probability of correct decoding is upper bounded as 

3 

P(corr. dec.) = P(corr.dec| J B i )P( J E J ) 

i=l 

< P(£i) + F(E 2 ) + P(corr.dec|E 3 ) 

< ( e -^ + e- N(a ~ D ^ Q *»)po\y(N) . 

Therefore if 

a > D{Q S \\Q„) , 

the probability of successful detection goes to zero as N tends to infinity. Since D(Q S \\Q+) tends to 
D(Qs\\Q*) as 60 I by continuity of £>(-||Q*)@ 

the result follows by maximizing D(Q s \\Q(-\*)) over 

s e X. 

■ 

10 Here we implicitly assume that TV is large enough so that the set of types Q s ± eo is nonempty. 
"Note that Q s may not be equal to Q s . 

12 We may assume D(-||Q*) is continuous because otherwise a(Q) = oo and there is nothing to prove. 
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